Model Selection

Human Preference Alignment

# Human Preference Alignment

Qwen3 32B GPTQ Int4

Qwen3 is the latest 8B parameter version of the Tongyi Qianwen series large language model, supporting thinking mode switching, multilingual processing, and tool invocation, with powerful reasoning and dialogue capabilities.

Large Language Model

Qwen3 is the latest 8B-parameter version in the Tongyi Qianwen series of large language models, supporting seamless switching between thinking and non-thinking modes with powerful reasoning, instruction following, and agent capabilities.

Large Language Model

Summllama3.2 3B

Text summarization model initialized from Llama3.2-3B-Instruct, optimized through large-scale summarization feedback DPO training

Large Language Model

Summllama3.1 8B

SummLlama3.1-8B is a text summarization model initialized from Llama3.1-8B-Instruct, optimized through large-scale summarization feedback via Direct Preference Optimization (DPO), excelling in fidelity, completeness, and conciseness.

Text Generation

Llama 3.1 Nemotron 70B Instruct HF

A custom large language model by NVIDIA, designed to enhance the usefulness of responses generated by LLMs to user queries.

Large Language Model

Transformers English

SummLlama3-8B is a text summarization model initialized from Llama3-8B-Instruct, optimized through large-scale summarization feedback via DPO training, demonstrating excellent performance in faithfulness, completeness, and conciseness.

Text Generation

Causallm 14B DPO Alpha GGUF

A 14B-parameter causal language model optimized with DPO, supporting English-Chinese text generation tasks

Large Language Model Supports Multiple Languages

Causallm 7B DPO Alpha GGUF

A 7B-parameter large language model based on Llama 2 architecture, optimized through DPO training, supporting Chinese and English text generation

Large Language Model Supports Multiple Languages

DISC-MedLLM is a domain-specific large language model for medical dialogue scenarios developed by Fudan University's DISC Lab, built upon Baichuan-13b-base, providing high-quality health support services.

Large Language Model

Transformers Chinese

Eleuther Pythia6.9b Hh Sft

A causal language model based on the Pythia-6.9b foundation model, fine-tuned using Anthropic's hh-rlhf dataset for supervised training

Large Language Model

Transformers English

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase